Audio-visual Multiple Active Speaker Localisation in Reverberant Environments

نویسندگان

  • Zhao Li
  • Thorsten Herfet
  • Martin Grochulla
  • Thorsten Thormählen
چکیده

Localisation of multiple active speakers in natural environments with only two microphones is a challenging problem. Reverberation degrades the performance of speaker localisation based exclusively on directional cues. This paper presents an approach based on audio-visual fusion. The audio modality performs the multiple speaker localisation using the Skeleton method, energy weighting, and precedence effect filtering and weighting. The video modality performs the active speaker detection based on the analysis of the lip region of the detected speakers. The audio modality alone has problems with localisation accuracy, while the video modality alone has problems with false detections. The estimation results of both modalities are represented as probabilities in the azimuth domain. A Gaussian fusion method is proposed to combine the estimates in a late stage. As a consequence, the localisation accuracy and robustness compared to the audio/video modality alone is significantly increased. Experimental results in different scenarios confirmed the improved performance of the proposed method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active Speaker Localisation and Tracking using Audio and Video

This thesis is concerned with the problem of tracking active speakers using audio and video data. Particular focus is placed on the task of tracking the current active speaker in a lecture room environment using multiple cameras and multiple microphones. A database of lecture recordings corresponding to this scenario from the European Integrated Project, Computers in the Human Interaction Loop ...

متن کامل

Algorithms for Audiovisual Speaker Localisation in Reverberant Acoustic Environments

Innovative and future human-machine interfaces or video conference systems require knowledge of the speaker’s position for automatic beamformerand camera-steering purposes. To determine this position, acoustical as well as visual localisation techniques can be applied, and the aim of this project was to develop suitable algorithms for such an audiovisual speaker localisation. Furthermore, an ex...

متن کامل

Speaker Localisation Using Audio-Visual Synchrony: An Empirical Study

This paper reviews definitions of audio-visual synchrony and examines their empirical behaviour on test sets up to 200 times larger than used by other authors. The results give new insights into the practical utility of existing synchrony definitions and justify application of audio-visual synchrony techniques to the problem of active speaker localisation in broadcast video. Performance is eval...

متن کامل

Separation of multiple concurrent speeches using audio-visual speaker localization and minimum variance beam-forming

Speaker segmentation is an important task in multi-party conversations. Overlapping speech poses a serious problem in segmenting audio into speaker turns. We propose an audio-visual speech separation system consisting of an array microphone with eight sensors and an omnidirectional color camera. Multiple concurrent speeches are segmented by fusing the two heterogeneous sensors. Each segmented s...

متن کامل

Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation

Previously, a dereverberation method based on generalized spectral subtraction (GSS) using multi-channel least mean-squares (MCLMS) has been proposed. The results of speech recognition experiments showed that this method achieved a significant improvement over conventional methods. In this paper, we apply this method to distant-talking (far-field) speaker recognition. However, for far-field spe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012